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DETAILED ACTION 

1 . This action is responsive to communications: The RCE filed 02/22/07. 

2. Claims 2 1 -22 have been added as necessitated by Amendment. 

3. The rejection claims 1-3, 6-9, 10-12, and 15-18 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Russell-Falla et al (US-6,675,162 01/06/04) in view of Chakrabarti et al 
(US-6,389,436 05/14/02) in further view of van den Akker (US-6,415,250 07/02/02)(Hereafter 
Akker) have been withdrawn as necessitated by Amendment. 

4. The rejection of claims 4-5 and 13-14 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Russell-Falla et al (US: 6,675,162 01/06/04) in view of Chakrabarti et al (US- 
6,389,436 05/14/02) in view of van den Akker (US-6 5 415,250 07/02/02) in further view of Haug 
et al (US: 6,556,964 04/29/03) have been withdrawn as necessitated by Amendment. 

5. Claims 1-22 are pending in the case. Claims 1 and 10 are independent claims. 

Claim Rejections - 35 USC § 103 

6. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

7. Claims 1-3, 6-12, 15-22 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Russell-Falla et al (US-6,675,162 01/06/04) in view of Chakrabarti et al (US-6,389,436 
05/14/02) in further view of Shmueli et al (US-6,442,555 08/27/02). 
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-In regard to independent claims 1 and 10, Russell-Falla et al teach a method and 
apparatus for determining content type of a web page comprising: 

providing a predefined set of potential content types exclusive of indicating formal 
language of the content (categories of content)(column 2, lines 35-43); 

for each potential content type (categories of content)(column 2, lines 35-43)(e.g. 
"pornographic", "racist", etc)(column 3, lines 39-43), preparing a distinguishing series of binary 
tests (column 2, lines 56-63; column 3, lines 39-57; column 4, lines 61-66)(i.e. testing each 
keyword or regular expression of the content against a database of keywords and regular 
expressions common to the content type for matching and weighting purposes), wherein at least 
one test determines whether a predefined piece of data or keyword appears in URLs (column 2, 
lines 5-9) of the subject Web page (column 2, lines 56-63; column 3, lines 23-57, column 4, lines 
61-66: i.e. testing all of a web pages textual content which as appreciated by one skilled in the art 
would include the text in the URLs of the selected web page); 

for each potential content type (categories of content)(column 2, lines 35-43)(e.g. 
"pornographic", "racist", etc)(column 3, lines 39-43), running the distinguishing series of tests 
(column 2, lines 56-63; column 3, lines 39-57; column 4, lines 61-66)(i.e. testing each keyword 
or regular expression against a database of keywords and regular expressions common to the 
content type for matching and weighting purposes) enabling quantitative evaluation of some 
contents of the selected web page being of the potential content type (column 2, lines 55-64); 

mathematically combining the test results (column 3, lines 54-57); and 

based on the results, assigning a probability (equivalent to the final rating of the page 
relative to the content category), for each potential content type, that shows the likelihood that 
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some contents of that type exist on the selected web page exclusive of indicating the language in 
which the content was written (column 3, lines 2-6). 

Russell-Falla et al do not teach wherein the distinguishing series of tests includes at least 
one or more non-binary tests. Chakrabarti et al teach a distinguishing series of tests to determine 
the potential content type of a web page (column 4, lines 5-15), wherein the distinguishing series 
of tests include non-binary tests (i.e. the test results in more than two possible outcomes)(column 
6, lines 27-67; column 7, lines 1-4). It would have been obvious to one of ordinary skill in the 
art at the time of the invention for Russell-Falla et al to have run the additional non-binary tests 
as taught in Chakrabarti et al for determining the content type of web pages, because Chakrabarti 
et al teach that by utilizing in/out links of the web page with the hypertext classifier (Fig. 1 : 110) 
the accuracy of classification goes up over those tests utilizing only local text/terms of the 
document to be classified (column 7, lines 34-59). 

Russell-Falla et al do not specifically teach wherein at least one test was examining page 
format or style other than position of data or keyword in the subject web page. Shmueli et al 
teaches a method for classifying a new document as a particular type based oh determining the 
format information within each portion of the document, wherein the format information 
includes font, font size, and justification (column 1, lines 28-47: "font, font size, and 
justification"). It would have been obvious to one of ordinary skill in the art at the time of the 
invention for one of the distinguishing series of tests of Russell-Falla to have analyzed the page 
format or style of the document web page as shown in Shmueli et al, because Shmueli et al teach 
that by utilizing a more robust document decomposition that looked specifically at document 
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format, a document could be automatically recognized and classified (column 1, lines 28-47; 
column 2, lines 13-17). 

-In regard to dependent claims 2, 11, and 15, Russell-Falla et al further teach wherein 
the set of potential content types could include web page articles/news with information about 
people (e.g. pornography, racism, terrorism) and other content (column 2, lines 10-23; column 3, 
lines 41-43). 

-In regard to dependent claims 3 and 12, Russell-Falla et al further teach producing a 
respective confidence level (equivalent to the rating of the page relative to the content category) 
for each potential content type when at least some of the web page content was of that type 
(columns 2 & 3, lines 54-67 & 1-6). 

-In regard to dependent claims 6 and 16, Russell-Falla et al further teach wherein the 
step of running the tests includes determining whether a predefined piece of data or keyword 
("weighting list") appears in the web page (column 2, lines 56-63). 

-In regard to dependent claims 7 and 17, Russell-Falla et al further teach wherein the 
step of running the tests includes determining whether a predefined piece of data or keyword 
("weighting list") appears in the web page (column 2, lines 56-63). 
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-In regard to dependent claims 8, 9, and 18, Russell-Falla et al do not teach storing 
indications of the assigned probabilities (web page ratings) of each potential content type cross 
referenced with each respective web page in a database. It would have been obvious to one of 
ordinary skill in the art at the time of the invention to have stored previously viewed web pages 
along with there respective ratings for content types local to the user, because it was well known 
in the art at the time of the invention that storing frequently view web pages with their ratings 
would significantly reduce the determination/processing time of the Russell-Falla et al system by 
eliminating undue identifying, analyzing, and calculating on identical web page requests. Thus a 
repeated request could render an appropriate web page more efficiently which would benefit 
Russell-Falla et al which teach that analyzing web pages could be difficult and time consuming 
(column 1, 38-43). 

-In regard to dependent claims 19 and 20, Russell-Falla et al teaches wherein one of 
tests was determining the number of phrases that contain the keyword (column 2, lines 56-63; 
column 3, lines 39-57; column 4, lines 61-67; column 5, lines 1-22: i.e. the number of keywords 
from the selected web page that have a corresponding entry in the pre-existing database are 
placed in a weighted list and then summed together to determine a rating to be compared to a 
given threshold). 

-In regard to dependent claims 21 and 22, Russell-Falla teaches in view of the Shmueli 
et al reference wherein one test included examining page format or style other than position of 
data. Because a test for examining syntax or grammar was listed in the alternative the limitations 
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of these claims do not further limit their parent claims via the path selected by the Examiner and 
are thus not considered. 

8. Claim 4-5 and 13-14 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Russell-Falla et al (US: 6,675,162 01/06/04) in view of Chakrabarti et al (US-6,389,436 
05/14/02) in view of Shmueli et al (US-6,442,555 08/27/02). in further view of Haug et al (US: 
6,556,964 04/29/03). 

-In regard to dependent claims 4 and 13, Russell-Falla et al further teach wherein the 
test results utilize a neural network (column 4, lines 1-5). Russell-Falla et al do not teach 
wherein the combining of the test results includes using a Bayesian network. Haug et al teach 
wherein the application of a Bayesian network for statistical pattern recognition provides 
improved system performance with additional training of the network (column 3, lines 8-16). It 
would have been obvious to one of ordinary skill in the art at the time of the invention, for the 
invention of Russell-Falla et al to have employed a Bayesian network as shown in Haug et al, to 
achieve the above mentioned improved system performance, because Russell-Falla et al do 
provide the needed training of the network (column 3, lines 58-67) which would be needed to 
increase the statistical recognition needed to support the Bayesian network. 

-In regard to dependent claims 5 and 14, Russell-Falla et al further teach the step of 
training the neural network using a training set of web pages with respective known content 
types and collecting the statistics on the test results of the training web pages (column 3, lines 
58-67). 
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Response to Arguments 

9. Applicant's arguments with respect to claims 1 and 10 have been considered but are moot 
in view of the new ground(s) of rejection. 

-In general the Applicant argues that the combination of references utilized in the 
rejection of he independent claims fail to teach or suggest the newly amended limitations of 
"content types being exclusive of indicating formal language of the content" and "and indicating 
content type, said indicating being exclusive of indicating language in which content is written." 
The Examiner agrees, however the Examiner does believe that the newly applied prior art 
teaches the newly presented claimed features. 

Conclusion 

10. The prior art made of record and not relied upon is considered pertinent to applicant's 
disclosure. 

US-2006/0288015 12-2006 Schirripa et al. 

1 1 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Adam L. Basehoar whose telephone number is (571)-272-4121. 
The examiner can normally be reached on M-F: 7:00am - 4:00pm. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Steve Hong can be reached on (571) 272-4124. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 
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